Sampling and Text Processing from Online Libraries (decompser.py)ΒΆ


With the decomposer module, one can sample random documents (books, etc.) from Project Gutenberg and Archive.org and rearrange the texts using Markov chain algorithms, cut-up, or by swapping instances of a part of speech between two texts.

The ParsedText class has several methods for random sampling.